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Abstract 

The spherical Sherrington-Kirkpatrick model is a spherical mean field model for spin glass. 
We consider the fluctuations of the free energy at arbitrary non-critical temperature for the 
2-spin model with no magnetic field. We show that in the high temperature regime the law of 
the fluctuations converges to the Gaussian distribution just like in the Sherrington-Kirtkpatrick 
model. We show, on the other hand, that the law of the fluctuations is given by the GOE 
Tracy-Widom distribution in the low temperature regime. The orders of the fluctuations are 
markedly different in these two regimes. A universality of the limit law is also proved. 


1 Introduction 

The spherical Sherrington-Kirkpatrick (SSK) model (with 2-spin interaction and no magnetic field) 
is dehned by the Hamiltonian 

HNi(T) = X d-1) 

where the spin variables <t = (ui, • • • he on the sphere ||<t|P = (•, •) 

denotes the Euclidean inner product. Here Jij = Jjj, 1 < * < j < A, are independent identically 
distributed random variables, representing the disorder of the system, and J = with 

Jii = 0. We assume that J 12 has mean 0 and variance 1. The free energy at inverse temperature /3 
is dehned by 

FN = FM = ^\ogZN, Zn= [ ( 1 . 2 ) 

where doj^ is the normalized uniform measure on the sphere Sn-i = {cr £ : ||<t|P = N}. Note 

that since Jij are random, Tjv is a random variable. The subject of this paper is the huctuations 
of F/v as Af —>■ 00 . 
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The usual Sherrington-Kirkpatrick (SK) model is a mean field version of the Edwards-Anderson 
model of spin glass, and is given by the same Hamiltonian as (1.1) but with the condition that the 
spin variables are on a lattice instead of the sphere: cr £ {—1,1}^. (For this case many literatnres 
use a different convention that the Hamiltonian is divided by 2.) The partition function is defined 
as Zn = X^cre{-i 1 }^ Among the numerous existing results on the SK model (see, for 

example, [49, 50, 37]), we here review a few about the free energy that are relevant to this paper. 
The non-random limit of the free energy, limAr^oo F'n, for the SK model was first predicted by Parisi 
[39] in the more general setting of p-spin interactions in the presence of external magnetic field. 
The Parisi formula was rigorously proved by Talagrand in his famous 2006 paper [48] in which he 
proved the convergence of the expectation to the Parisi formnla when the disorder random variables 
Jij are Gaussian. The universality of the limit of TV independent of J is proved under the finite 
third moment condition, with mean 0 and variance 1 by Carmona and Hn [19], which improved 
the previous result by Guerra and Toninelli [28] for symmetric random variables with hnite fourth 
moment. The Parisi formnla is implicit and is given in terms of a variational problem. For a recent 
study on this variational problem, see [5]. An important feature here is the existence of the critical 
temperature f5c = \- The cardinality of the support of the measure that underlies in the Parisi 
formula changes at /3 = /3c. The phase transition is also understood by the fact that the difference 
between the quenched disorder free energy and the annealed free energy, ^E[log Zat] — ^ logE[Z 7 v], 
tends to zero as A^ ^ oo in the high temperature regime, /3 < /3c, but does not tend to zero in the 
low temperature regime, (3 > j3c (see [1]). 

The fluctuations of the free energy for the SK model in the high temperature regime was studied 
by Aizenman, Lebowtiz, and Ruelle [1]. They showed that if the disorder random variables Jij, 
3 < i < j < N, are independent Gaussian with mean zero and variance 1, AA(0,1), then 

A(Fjv-(log2 + /32)) ^Ar(-la,a), (1.3) 

where 

a =-^log(l - 4/3^) - 2/3^, (1.4) 

and the convergence is in distribution as —)• oo (see also Section 11.4 in [50].) It was also shown in 
[1] that for non-Gaussian disorder, the same limit theorem holds with some changes on the formula 
of a. However, a limit theorem for the fluctuations in the low temperature regime still remains as 
an open question. The limit theorem is not known even for the zero temperature case. 


The SSK model was introduced by Kosterlitz, Thouless, and Jones [31] as a model that is easier 
to analyze than the SK model. Indeed in their paper, the anthors evaluated the limit of the free 
energy explicitly though a rigorous proof was not supplied. The analogue of Parisi formula for the 
SSK model was obtained by Crisanti and Sommers [24]. The Parisi formula for the SSK model 
was later proved rigorously by Talagrand [47] immediately after he proved the formula for the SK 
model. The Parisi formula can be evaluated explicitly for the Hamiltonian (1.1) (for the case with 
2-spin interactions without magnetic field) [38] and the resnlting formula is same as one obtained 
in [31]: 


Fn ^ F(/3) = 


(1.5) 


/32 if0</3<l/2, 

2/3 _ log(2/3K3/2 if ^>1/2 

in expectation and also in distribution. For a general class of random variables Jij, a corresponding 
result for the limiting free energy F(/3) was obtained in [29]. The formula in [29] was given in terms 
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of R-transform and one can check that it is same as the one in Definition 2.13 below. Note that 
F(/3) in (1.5) is but not at j3c = the critical temperature is same as that of the SK model. 
The third-order transition also holds for SSK model with general random variables Jij as one can 
see in Definition 2.13. 

We remark that paper [10] studied the so-called “soft” spherical Sherrington-Kirkpatrick (SSSK) 
model and evaluated the almost sure limit of the free energy explicitly. The limit of the free energy 
for the SSSK model also shows a third order phase transition. 

In this paper, we obtain the limit theorem for the fluctuations of for the SSK model. We 
first state the result when the disorder random variables are Gaussian. Non-Gaussian case will be 
stated in Theorem 1.2. 


Theorem 1.1. Let Jij, \ < i < j < N, he independent Gaussian random variables with mean 0 
and variance 1, and set Jji = Jij. Let 


Fn = Fn{/3) 




_ §_ 

e Vn ^-^^3 




( 1 . 6 ) 


he the free energy of the SSK model at inverse temperature /?, where dcjAr is the normalized uniform 
measure on the sphere Sn-i = {cr e : ||cr|p = N}. Then the following holds as N ^ oo. Here 
F{(3) is defined in (1.5), and all the convergences are in distribution. 

(i) In the high temperature regime 0 < /3 < 

N(Fjv-F(/3})^M(f,a}, (1.7) 


where 

f = \ log(l - 4 / 32 ) - 2 / 32 , a = -\ log(l - 4/32) - 2(1^ (1.8) 

(a) In the low temperature regime /3 > ^, 

-\n^/^Fn-F{/3))^TWu (1.9) 

p 2 

where TWi is the GOE Tracy-Widom distribution. 

Hence the order of the fluctuations changes from N~^ in the high temperature regime to 
in the low temperature regime. In the high temperature regime, the fluctuations are asymptotically 
Gaussian with the same variance as in SK model (see (1.4)). On the other hand, in the low 
temperature regime, the fluctuations are asymptotically same as those of the largest eigenvalue of 
a large random matrix from GOE (Gaussian orthogonal ensemble). The connection to the GOE 
Tracy-Widom distribution is apparent at zero temperature: Erom (1.6), we may define the free 
energy at zero temperature (which is the formal limit of ^FAr(/3) as /3 —)■ 00 ) as 

Fn{oo)= sup ^(<T, ^cr) = Amax(3V) (1-10) 

||o-||2=Ar3V y/N 

where Aniax(3V) is the largest eigenvalue of the random symmetric matrix M = —J/^/N, J = 
{Jij)fj=v The random matrix M is almost exactly an x GOE matrix except that the diagonal 
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terms are zero. The fluctuations of the smallest and the largest eigenvalues are still given by the 
GOE Tracy-Widom distribution in the limit N ^ oo [45]: N‘^^^{Fn{oo) — 2) ^ TWi. The theorem 
above shows that the same limit law for the fluctuations hold for all /3 > ^ after the change by 
the multiplicative factor (3 — The proof of the theorem will show that for /3 > the main 
contribution to TV comes from the largest eigenvalue of — J and this holds not only for the leading 
asymptotic term of TV but also for the second term corresponding to the fluctuations. On the 
other hand, we will see in the proof that for /3 < ^, all of the eigenvalues of the random matrix 
—J/y/N contribute to Fjv in the form of the linear statistic g{\) where A, are the eigenvalues 
of —J/y/N for a specific function g. It is a well-known result in random matrix theory that if the 
function g is smooth in an open interval that contains the support of the limiting density function 
of the eigenvalues, then the linear statistic converges to the Gaussian distribution [30, 9, 7, 33]. 

For the SK model, there is no analytic result for the fluctuations in the low temperature regime. 
See [40] for some physical analysis and conjectures. Some numerical studies [4, 12 ] suggest that 
at zero temperature, the order of the fluctuations are smaller that ([ 12 ] suggests 

and the limiting distribution is not the GOE Tracy-Widom distribution. See [20] (also [21]) for a 
mathematical result on the upper bound of the order of the fluctuations. 

It is interesting to consider the near critical case when (3 depends on N and satisfies /3 ^ ^ as 
N ^ oo. Even though we do not show in this paper, it is possible to improve the proof in this 
paper to show that (1.9) still holds when f3 = (3^ = ^ + with 5 < |. It is expected that (1.7) 
also holds when (3 = (3^ = \ — with 5 < 5 . These will be discussed in a future paper. It is 
tempting to predict the critical window of (3 in which the transition from the Gaussian distribution 
to the GOE Tracy-Widom distribution occurs by using Theorem 1.1. The theorem indicates that 
the leading-order term of the variance of TV is of order log(l — 2/3) for /3 < 1/2 and of order 
j^j^{l3 — ^)^ for j3 > 1/2, as N —)• 00 and /3 —By matching these orders, we are lead to 

speculate that the critical window of the temperature is /3 = | -|- Q( ^°vi^ ')- 

Another model in which the Tracy-Widom distribution appears is the directed polymer in 
random environment (DPRE) in 1 -|- 1 dimension. Recent impressive developments in the held 
show that for some specific choices of disorders, the fluctuations of the free energy are given by the 
GUE Tracy-Widom distribution for all /3 > 0 [3, 13, 14, 23, 36]. (Here GUE stands for Gaussian 
unitary ensemble.) It was indeed shown previously in [18, 22] that the critical temperature for 1-1-1 
dimensional DPRE is /3c = 0. Recently it was shown by Alberts, Khanin, and Quastel [2] that the 
critical window is /3 = More specifically if /3 = then the fluctuations of the 

free energy converge to a different distribution parametrized by B, called the crossover distribution 
that appears in the KPZ equation (see also [35]). This regime is called the intermediate disorder 
regime in [ 2 ]. 

For non-Gaussian disorder random variables, we have the following universality result. Note 
that the disordered random variables are not necessarily identically distributed. 

Theorem 1.2. Let Jij, 1 < * < j < iV, he independent random variables satisfying the following 
conditions: 

• All moments of Jij are finite and IE[Jjj] = 0 for all 1 < i < j < N. 

• For all i < j, E[Jj?] = 1, E[[ J^j^] = W 3 , and E[jA] = W 4 for some constants IT 3 , W 4 > 0. 

• For all i = 1, ■ ■ ■ ,N, E[J/J = W 2 for a constant W 2 > 0. 
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Set Jji = Jij for i < j. Define the free energy as (1.6) with the sum replaced by Ylfj=i 
Then (1.7) still holds after the changes 


/ = i log(l - 4 / 3 ^) + - 2) + 2l5\Wi - 3), a = log(l - + fi\w 2 - 2) + - 3). 

( 1 . 11 ) 


On the other hand, (1.9) holds without any changes. 


The starting point of the proofs of the above theorems is a simple integral formula of the 
partition function. 

Lemma 1.3. Let M be an N x N symmetric matrix with eigenvalues Ai > • • • > Xjsf. Then 

p ^7+icxD 1 

/ ef^^^’^'^Uu;N{cT) = CN eTG(-)d^, G(z) = 2/3z - - ^ log(z - A,), (1.12) 

JSn-i J-f -ioo • 


where 7 is any constant satisfying 7 > Ai, the integration contour is the vertical line from 7 — ioo 
to 7 + ioo, the log function is defined in the principal branch, and 


T{N/2) 

27ri(A^/3)^/2-i ■ 


(1.13) 


Here r( 2 :) denotes the Gamma function. 


We may apply the method of steepest-descent to analyze the asymptotic behavior of the above 
integral as N ^ 00 . The difficulty is that G(z) is random since M = —J/y/N is a random 
matrix. Now random matrix theory tells us that the eigenvalues of random symmetric matrix M 
have strong repulsions between them and as a consequence they are rigid in the sense that the 
eigenvalues are close to the deterministic locations determined by the quantiles of their limiting 
empirical distribution (i.e. semi-circle law). This rigidity of the eigenvalues allows us still to be able 
to apply the method of steepest-descent. The crucial technical ingredient here is precise estimate 
on the rigidity of the eigenvalues that was obtained recently by Erdos, Yin, and Yau [27]. This 
precise rigidity estimate is one of the central achievements of the recent surge of advancements of 
our understanding of random matrices. After we obtained the above integral representation ( 1 . 12 ), 
analyzed them asymptotically, and obtained the results in this paper, we learned that the same 
integral representation was already obtained in the paper of Kosterlitz, Thouless, and Jones [31] 
in which they obtained the leading order term of the asymptotics by using the method of steepest- 
descent but without supplying rigorous estimates. Our analysis makes their work rigorous, and 
goes a step further and obtains the second asymptotic term giving the law of the fluctuations. The 
paper [31] also considered the case when the mean of the disorder is not necessarily zero. We plan 
to study this case in a separate paper by the same method as in this paper. A similar formula to 
the above Lemma also appeared in [34] for the analysis of rank 1 real Wishart spiked model. 

Since our analysis only relies on the above integral formula and the rigidity of the eigenvalues, 
the random matrix, M, corresponding to the disorder random variables does not necessarily have 
to have independent entries (hence corresponding to the real Wigner matrices). We indeed obtain 
similar results for M from orthogonal invariant ensembles or real sample covariance matrices, and 
for M from complex Hermitian matrices, since the rigidity of the eigenvalues was proved for a wide 
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variety of random matrices. In the next section, we state general results assuming some spectral 
properties of random matrices, and in the subsequent section, we list a few of random matrices, 
including the one corresponding to the SSK model, for which the results may apply. 

The rest of paper is organized as follows. In Section 2, we introduce general conditions and state 
general results. In Section 3, we illustrate the examples of random matrix ensembles that satisfy 
the general conditions. Theorems 1.1 and 1.2 follow from one of these examples. In Section 4, we 
prove Lemma 1.3. Sections 5 and 6 are the main technical part of this paper in which we analyze 
the integral representation in Lemma 1.3 asymptotically by using the method of steepest-descent. 
The high temperature regime is analyzed in Section 5 and the low temperature regime is analyzed 
in Section 6. In Section 7, we prove Theorem 2.9, on the third order phase transition of the free 
energy. Some technical details in Section 3 are collected in the Appendix. 

Remark 1.4 (Notational Remark 1). Throughout the paper we use C or c in order to denote a 
constant that is independent of N. Even if the constant is different from one place to another, we 
may use the same notation C or c as long as it does not depend on N for the convenience of the 
presentation. 

Remark 1.5 (Notational Remark 2). The notation ^ denotes the convergence in distribution as 
N ^ QO. 

We close this section by introducing the following terminology. 

Definition 1.6 (High probability event). We say that an Wdependent event Qj\f holds with high 
probability if, for any given D > 0, there exists Nq > 0 such that 

< N-^ 


for any N > Nq. 
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2 General results 


2.1 Definitions and conditions 

Definition 2.1. For an N x N real random symmetric matrix M = we define the 

partition function at inverse temperature /3 > 0 by 


Zn = Zn{I3)= [ e^<'"’'^'">da;Ar(<T), 

J Sn-1 


N 

{a,Ma) = ^ Mijataj, 


( 2 . 1 ) 
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where dw^v is the normalized uniform measure on the sphere S^-i = {cr E : ||<t|P = A^}. The 
free energy Fn is dehned by 

Fn = F]y{/3) = — log Zn. (2.2) 

The free energy (1.2) for the SSK model corresponds to the case when M = —J/^/N where J 
is a symmetric random matrix whose diagonal entries are zero and the entries below the diagonal 
are independent and identical random variables of mean 0 and variance 1 . As mentioned in the 
previous section, we prove the limit theorem for the fluctuations for more general random symmetric 
matrices. Precise conditions on M in terms of its eigenvalues will be stated shortly below and these 
conditions are shown to be satished for Wigner matrices, invariant ensembles, and sample covariance 
matrices in Section 3. 

We also consider Hermitian matrices. 

Definition 2.2. For an N x N complex random Hermitian matrix M, we define 

N 

Zn = Zn{I3) = / e^^'^’^'^Ma;Ar(<T), {a,Ma) = 'V MijOlaj, (2.3) 

where CSn-i = {cr € : ||cr|p = N} ~ S 2 N -1 and a;Ar(cr) is the uniform measure on 

The free energy is defined by the same formula (2.2). 

For a symmetric or Hermitian matrix M, let Ai > A 2 > • • • > Aat denote the eigenvalues of 
M. We now list four conditions for the eigenvalues of random matrix M under which the general 
theorems are proved. 

Let vn := ^ denote the empirical spectral measure of M. We assume that con¬ 

verges weakly to a probability measure u. Our first condition is the regularity of the limiting 
spectral measure u in the following sense. 

Condition 2.3 (Regularity of measure). Suppose that the empirical spectral measure vn of M 
converges weakly to a probability measure v that satisfies the following properties: 

• n is supported on an interval [C'_,C'+] and is positive on (C'_,C'+). 

• u is absolutely continuous and ^ exhibits square root decay at the upper edge, i.e., 

^(x) = -I-0(C'+- x)) as XC+ (2.4) 

for some Su > 0 . 


The second condition concerns the rigidity of the eigenvalues. This is the key assumption. 

Condition 2.4 (Rigidity of eigenvalues). For a positive integer A: E [1, W], let k := min{A:, N + 1 — 
k}. Let be the classical location defined by 


Assume that for any e > 0 





holds for all k with high probability. 


(2.5) 

( 2 . 6 ) 
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We remark that under the assumption (2.4), the classical location 7 ^ satishes the estimate 

(7-i^2/3jy-2/3 < |C+ - 7 fc| < CA:2/3iV-2/3 (2.7) 

for some constant C > 1 that is independent of N. 

The third condition is about the linear statistics of the eigenvalues and it is used in the analysis 
of the high temperature case /3 < (5c- 

Condition 2.5 (Linear statistics of the eigenvalues). Assume that for every function y? : R —)• M 
that is analytic in an open neighborhood o/[C'_,C'+] and has compact support, the random variable 

:= ^ ip{\i) - N ip{\)du{X) (2.8) 

i 

converges in distribution to a Gaussian random variable. The mean and the variance of this Gaus¬ 
sian random variable are denoted by M{(p) and V{(p), respectively, and they depend only on ip 
restricted on the support of v. 

The fourth condition is the convergence to the Tracy-Widom distribution of the largest eigen¬ 
value. This will be used in the analysis of the supercritical case (3 > (3c- 

Condition 2.6 (Tracy-Widom limit of the largest eigenvalue). Let Sy be the constant appearing 
in (2.4). Assume that the rescaled largest eigenvalue (sj/ 7 r)“^/^A^^/^(Ai — (7+) converges in distri¬ 
bution to the (GOE or GUE) Tracy-Widom distribution (depending on whether the matrices are 
symmetric or Hermitian). 

We denote by TWi and TW 2 GOE and GUE Tracy-Widom random variables and by Fi and 
F 2 their cumulative distribution functions, respectively. 

Remark 2.7. For invariant ensembles, the support of the measure i' in Condition 2.3 may consist 
of several disjoint intervals. In this case, Condition 2.5 does not hold in general; the variance of the 
linear statistics is a quasi-periodic function in N and does not converge. See [41]. On the other 
hand. Condition 2.6 is known to hold for multi-inverval cases as well. In this paper we choose to 
simplify the situation by assuming that the support consists of a single interval; one can still prove 
some parts of the theorems in the next subsection without assuming this condition. 


2.2 Results for symmetric matrices 

We first dehne the critical inverse temperature. 


Definition 2.8 (Critical (3). Assume that Condition 2.3 holds. Define 


(3:^'^ = (3c 


1 r^+ dujx) 

2 J(^_ ( 7 + — X 


(2.9) 


Note that (3c is finite due to the square root decay of ^ at the upper edge. The exact values of 
(3c can be evaluated for some random matrix ensembles (see Section 3) as follows. 


• /^c = 5 for Wigner matrices (with the choice C+ = 2): SSK model belongs to this example. 



• f3c = \Q'{Cj^) for invariant ensembles with potential Q. 

• for sample covariance matrices. 

The hrst main result is the following. 

Theorem 2.9 (Third order phase transition). Consider an ensemble of symmetric matrices satis¬ 
fying Conditions 2.3, 2.f, 2.5, and 2.6. There is a function F : (0,oo) —)• [0, oo) such that the free 
energy Fn satisfies 

Fn{/ 3) => FiP) ( 2 . 10 ) 

in distribution as N ^ oo, for jd ^ j3c. The function T(/3) is but its third derivative d^F{l3) is 
discontinuous at fd = fdc- This function is defined explicitly in Definition 2.13 below. 

The next two results are about the fluctuations of Fn. 

Theorem 2.10 (high temperature case). Consider an ensemble of symmetric random matrices 
satisfying Conditions 2.3, 2.4, and 2.5. Then for /d < fie, 

N{FM{fi)-F{fi))^M{fia^) ( 2 . 11 ) 

where the constants i = £{fi) and = cr'^ifi) are defined in Definition 2.13 below. 

Theorem 2.11 (low temperature case). Consider an ensemble of symmetric matrices satisfying 
Conditions 2.3, 2.4, and 2.6. Then for fi > fic, 

(2.12) 

Here the constant Sy is the one in Condition 2.3. 

Remark 2.12. For a given random symmetric matrix M, defines a probability measure 

on the sphere 5^-1. It is interesting to study how far a random point cr on Sn-i under this 
probability measure is from the eigenspace for the largest eigenvalue of the matrix. One such a 
measurement is the random variable 

1 r phH{cr) 

Em [|(cr,ui)|2] = — |(cr,ui)|2^—du;7v(cr) (2.13) 

where Vi is an ^ 2 -iiormahzed eigenvector associated with Ai. Note that due to the absolute value 
in |(cr,ui)l, (2.13) does not depend on the choice of the eigenvector vi. It is easy to check 
that Em [|(<t,ui)P] = and, from this formula, it is straightforward to prove that 

Em [I= 0{N~^) for fi < fic hy modifying the analysis in Section 5 for the proof of 
Theorem 2.10. This is consistent with the fact that for the high temperature case, all eigenvalues 
contribute to the free energy and the fluctuations of F)v come from the fluctuations of certain linear 
statistics of all of the eigenvalues. On the other hand, for the super-critical case when fi > fic, we 
expect that Em converges to a constant that depends on fi. 
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The constants appearing in the above theorems are given as follows. Note that h{s) := 
1 Ic^ ^ decreasing function for real s > (7+ and h{s) —)• 0 as s ^ +oo. Moreover, by the 

definition 2.8 of f3c, h{s) ^ /3c as s \ (7+. Hence for (3 < /3c, there is a unique 7 = 7 (/ 3 ) € ((7+, 00 ) 
that satisfies 

2Jc7_ 7-x 

Note that 7 (/ 3 ) is an decreasing function in /3 € (0, /3c) and 7 (/ 3 ) \ (7+ as (3 f3c- 

Definition 2.13. Define 


and 


F(/3) = /37(/3) - 


F(/3) = /3C+ - 


1 


1 


r.C+ 


'C- 


log( 7 (/ 3 ) - k)dv{k) + 1 + log(2/3) 


r-C+ 


'C- 


log(C'+ - k)<iv{k) + 1 + log(2/3) 


Furthermore, for /3 < /3c, define 


where 




£i(/3) = log(2/3) - ^ log ^ 


/c_ (7(/3)-A:)2 
and M{(p) and V{^p) are defined in Condition 2.5 with 

Lp{x) = log( 7 (/ 3 ) - x). 


/3 < 13c, 


I3 > /3c. 


(2.15) 

(2.16) 

(2.17) 

(2.18) 

(2.19) 


2.3 Results for Hermitian matrices 

All of the previous results hold for Hermitian matrices after the following simple changes: 

1. The critical value in Definition 2.8 is changed to (3^ = 2/3c^™'. 

2. Theorems 2.9 and 2.10 hold without any changes. 

3. Theorem 2.11 holds with TWi replaced by TW 2 in (2.12). 

4. In Definition 2.13, all terms remain the same after the change that /3 is replaced by (3/2. For 
example, F^{f3) = F{(3/2). 


3 Examples 

We list some random matrix ensembles, which satisfy the conditions in Subsection 2.1, and hence 
to which the general results in Subsections 2.2 and 2.3 apply. 
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3.1 Wigner matrix 

A real Wigner matrix is an x real symmetric matrix M whose upper triangle entries 
^ j) are independent real random variables satisfying the following conditions: 

• The entries are centered, i.e., = 0 for all i,j. 

• Their variances satisfy that E[|Mjjp] = ^ for i 7 ^ j and E[|Mjjp] = ^ for a constant W 2 > 0. 

• For any integer p > 2, E[|Mjj|^] = Moreover, E[|Mjjp] and E[|Mjj|^] do not 

depend on i^j. 

A complex Wigner matrix is an A^ x A^ complex Hermtian matrix M whose real and imaginary 
parts of the entries are all independent, modulo the Hermitian condition, and satisfy the same 
moments conditions as above and an extra condition that E[(Mjj)^] = 0 for i 7 ^ j. 

Remark 3.1. We note that some of the Conditions in Subsection 2.1 are still satisfied even if some 
of the conditions on the definition of Wigner matrices, such as the existence of all moments, are 
relaxed. However, we content with the above definition of Wigner matrices so that all of the four 
Conditions in the previous section are simultaneously satisfied. Similar remark also applies to the 
random matrix ensembles in the next two subsections. 

For real and complex Wigner matrices, the following are known: 

1. Condition 2.3 (Regularity) 

For both real and complex case, the limiting spectral measure is given by the semicircle law, 

= 7^\/4 — —2 < X < 2. (3.1) 

ax 27r 

Hence Condition 2.3 is satisfied with (7+ = 2 and Su = ^. 

2. Condition 2.4 (Rigidity) 

Condition 2.4 was proved in [27] for both real and complex cases. 

3. Condition 2.5 (Linear statistics) 

Condition 2.5 was proved in [9]. See also [7] and [33] for non-analytic test functions. The 
mean M[ip) and the variance V((p) for function cp are as follows. Let 

W 2 := NE[\Mii\‘^], W 4 := N‘^E[\Mi 2 \'^]. (3.2) 

We have W 2 = 2, W 4 = 3 for GOE, and W 2 = 1, W 4 = 2 for Gaussian unitary ensemble (GUE). 
Set ^ 

Ti{tp) = — [ (p{x)—^^J=J=dx = — [ (p{2 cos 6 ) cos{£ 6 )d 6 (3.3) 

TT J-2 V 4 - x2 27r 

for £ = 0,1, 2, • • • , where Ti{t) are the Chebyshev polynomials of the first kind; To(t) = 1, 
Ti{t) = t, T 2 {t) = 2t‘^ — 1, T 3 {t) = — 3t, T 4 {t) = 8 t^ — + 1, etc. 

The mean and the variance for the real case are 

M{ip) = Mcoe{<r) + T 2 {<p)[w 2 - 2 ) + T4{p>){W4 - 3) (3.4) 
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and 

V{^p) = Vgoe{^) + Ti{ipf{w2 - 2) + 2t2((^)^(W4 - 3), 

respectively, where 

Mgoe{v) = ^ [¥^(2) + <^(-2)] - 

and 


yGOE{^) = 2 

i=i 

27r2 J _2 J _2 V Ai — A 2 


4 — A 1 A 2 
V^4 — Af y^4 — A 2 


dAidA2- 


For the complex case, 

M^{ip) = Mgue{^) + T2{^){w2 - 1) + Ti{ip){W4^ - 2) 

and 

v^{tp) = Vgue{v>) + ti{^)^{w2 - 1) + 2r2((/?)^(W4 - 2), 

respectively, where 

Mgue{^) = 0 , VGUEi^) = yGOE{^)l‘^- 


(3.5) 

(3.6) 


(3.7) 


(3.8) 

(3.9) 
(3.10) 


4. Condition 2.6 (Tracy-Widom limit) 

The Tracy-Widom distribution limit of the largest eigenvalue was proved in [45, 51, 27] for 
both real and complex cases. 


We can evaluate the various constants appearing in the theorems in Subsection 2.2 explicitly 
and obtain following for real Wigner matrices. 


(i) Pc = i 

(ii) The limit of the free energy is 


m 


/32 if 0< (3 <1/2 

2/j - ^°sm+3/2 if ^>1/2. 


(3.11) 


(hi) For P <^, N {Fn{P) - T(/3)) ^ Af{i, a^) where 

£ = ^ (log(l - 4/3^) + 4/3^(w2 - 2) + 8/3\W4 - 3)) (3.12) 

and 

= i (- log(l - 4f) + 2f(w2 - 2) + 4/3^(W4 - 3)) . (3.13) 

(iv) For /3 > i, (/3 - i)-iiV2/3 (Fjv(P) - F(/3)) => TWg 


12 







See Appendix A.l for the detail. This proves Theorem 1.1 and Theorem 1.2. 


For complex Wigner matrices, we have the following changes: (i) j3^ 
L(/3/2). For (hi), ^ 

I = i (^log(l - /32) + ^\w 2 - 1) + - 2)) 

and 

«4 

cr^ = - log(l - 0^) + 0{W2 - 1) + - 2)- 

for /3 < 1. For (iv), /3 is replaced by /3/2 and TWi by TW 2 - 


1 and (ii) T^(/3) = 

(3.14) 

(3.15) 


3.2 Invariant ensemble 

The orthogonal invariant ensemble associated with potential Q : M —>• M is dehned by the density 

lP(M) = iexp(-yTrQ(M)) (3.16) 

on the space of x real symmetric matrices and Z is the normalization constant. Similarly, the 
unitary invariant ensemble associated with potential Q : M —)• M is defined by the density 

V^{M) = ^eM-NTTQ{M)) (3.17) 

on the space of complex Hermitian matrices, where is the normalization constant. The GOE 
and the GUE correspond to the choice Q{x) = x^/2. We assume that Q is a polynomial of even 
degree with positive leading coefficient. Many of the results below hold true for more general Q 
but we restrict to this class of Q for the convenience of the presentation. Furthermore, we assume 
that the associated equilibrium measure v is of form 

dv{x) = l[c_,c+\{x)Kx)\/{C+ - x){x - C'_)dx (3.18) 

for constants > C- and for a function h{x) that is real analytic and positive in an open set 
containing the interval [C_,C'+]. Recall that for general Q, the support of v may consist of several 
intervals. Here we make the single interval assumption in order to use the central limit theorem 
for linear statistics. On the other hand, the square root behavior at the end points of the support 
holds for generic Q [32]. We remark that (3.18) is guaranteed if Q is convex. 

The following are known: 


1. Condition 2.3 (Regularity) 


This holds from the assumption (3.18) since the limiting spectral measure is given by the 
equilibrium measure. We remark that, from the variational condition on v, the following 
relation holds: 



di'{x) 

C+-x' 


(3.19) 


2. Condition 2.4 (Rigidity) 

Condition 2.4 is proved in [16] (see also [15, 17]). 
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3. Condition 2.5 (Linear statistics) 

The linear statistics of eigenvalues was proved in [30]. The mean M (cp) and the variance V{(p) 
for function (/? are as follows. For orthogonal invariant ensemble, we have 

Mo.(.) = i (.(C_) + .(CJ, - £ (3.20, 

for some rational function Uq which depends on Q. The formula of Uq is complicated and is 
given in (3.54) of [30]. It is, in particular, given by Uq{x) = ^ when Q{x) = On the 
other hand, 

Voe{^) = VcoEm, Hx) = ^ + , (3.21) 

where Vqoe is the variance (3.7) for the GOE case. The change from to ^ comes from 
the simple translation of the interval (C'_,C'+) to (—2,2). Observe that the variance Voe{^) 
does not depend on the structure of the equilibrium measure except for the end points C- 
and (7+.On the contrary, the mean MoEi^) depends on the full structure of the equilibrium 
measure. 

For unitary invariant ensemble, we have 

MuEi'^) = 0, VuEif) = VoEif)!"^- (3.22) 

We remark that Condition 2.5 is not always valid if suppz^ consists of multiple disjoint inter¬ 
vals. See [41] for more detail. 

4. Condition 2.6 (Tracy-Widom limit) 

Condition 2.6 was proved in [42, 11, 26, 25, 16]. 


We can easily check from (3.19) that the critical value is 

11 ?’^ = 1 = 1(3'(c+), /3™ = lg'(c+) (3.23) 

for orthogonal ensembles and unitary ensembles, respectively. Other constants appearing in the 
main theorems of this paper in the previous section may be evaluated for a given potential once 
the equilibrium measure is obtained. 

3.3 Sample covariance matrix 

Let X he a K X N matrix whose entries are independent real random variables satisfying the 
following conditions: 

• The entries are centered, i.e., ]E[Wj] = 0. 

• Their variances satisfy that E[|Xjjj^] = 
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• For any integer p > 2, E[|Xjj|P] = 0{N p/^). Moreover, E[|Xjjp] and E^Xyl"^] do not depend 
on i,j. 

A sample covariance matrix M is a random matrix of the form M = X*X. When X is a complex 
matrix, then we assume, in addition, that E[(Xjj)^] = 0. We assume further that K = K{N) with 

— —)■ d G [1, oo) (3.24) 


as —>• oo. 

The following are known. 

1. Condition 2.3 (Regularity) 

The limiting spectral measure is the Marchenko-Pastur distribution given by 


j ^ ^ 1 \/(C'+ -x){x-C-) 

ai'{x) = —--dx, 


27r 


(3.25) 


with support [C., (7+] = [{s/d - 1)^, {s/d + 1)^]. Hence ■ 


2. Condition 2.4 (Rigidity) 

Condition 2.4 was proved in [44]. 

3. Condition 2.5 (Linear statistics) 

Condition 2.5 was proved in [6]. For non-analytic test functions, see [8] and [33]. The mean 
AI{(p) and the variance V{ip) for function (f are as follows: see (1.3)-(1.5) in [8], (5.13) of [6], 
and (4.28) of [33]. Let kF 4 = N^E[\Xn\^]. Set 


, ,C- + C+ C+-C- 

^(a:) = ^ + 


= ip{d + 1 + xVd). 


(3.26) 


2 ' 4 

For real sample covariance matrix, 

AI{ip) = MGOE{^)-{Wi-3)T2{^), V{ip) = VGOE{^) + {Wi-3)Ti{^)^ (3.27) 

For complex sample covariance matrix, 

M“"^p(^) = -(W4-2)t2(4>), R“”*p(^) = RG[/E($) + (W4-2)ri(4>)2. (3.28) 

4. Condition 2.6 (Tracy-Widom limit) 

Condition 2.6 was proved in [46, 43, 52, 44]. 

Various constants can be evaluated explicitly and we obtain the following. See Appendix A.2 
for the detail. 


(i) Pc = 


2y/c^ 2(-s/d+l) ■ 
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(ii) The limit of the free energy per particle is 


m = 


-^log(l - 2/3), 


/ 3 < 


(1 + ^ (^(1 + ^/d) + dlog - log + log(2/3)) , /3 > 


2{Vd+l) ’ 

1 

2(\/d+l) 

(3.29) 


(hi) For /3 < /3c, N{Fn{I 3) — F{f3)) M{i,cr‘^) where where L is given in (3.29) and 


i = — {logil-iB^)-AB^{W^-3)), a^ = 


1 

Ip 

where we set 


1 


B = 


2/3' 

pVd 


2 (- log(l - AB^) + B\W^ - 3)) (3.30) 


1-2/3 

Note that i3 < 1/2 for /3 < /3c and i3 > 1/2 for /3 > Pc- 
(iv) For P > Pc, - Pc)-^N‘^/^ {FM - FiP)) => TW^. 


(3.31) 


For complex sample covariance matrices, P^ = (P) = L{P/2), and 

i = ^ (log(l - 4B^) - AB\W^ - 2)) , ^ (- log(l - AB^) + AB^W^ - 2)) . (3.32) 

4 Integral representation of the partition Function 

Our starting point in the analysis is the integral representation of the free energy given in Lemma 
1.3. We prove it here. As mentioned in Introduction, this formula was also obtained in [31], and a 
similar formula appeared in [34]. 

Proof of Lemma 1.3. Let = {cc S : [[a;]] = 1}, the unit sphere in , and let dll be the 

surface area measure on Hence is the uniform measure on We denote the 

left-hand side of (1.12) by Zj\f as in (2.1). By change of variables, 

- isiri 

We diagonalize M and let M = DO for an orthogonal matrix O and a diagonal matrix D = 
diag(Ai, A 2 , • • • , Xn)- Since {x, Mx) = {Ox, DOx) and O is orthogonal, we find after the changes 
of variables x 1 —)• 0~^x that 

L- L_. ('‘•2) 

In order to evaluate the integral, we consider 

J{z) := [ dy, z > Xi. (4.3) 
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We evaluate J{z) the above integral in two different ways. First we evaluate it directly using 
Gaussian integral and second, we use polar coordinates. By evaluating the Gaussian integrals, we 
obtain 

/ \ ^/2 1 


y/z - \i 


On the other hand, by using polar coordinates, we substitute y = rx, r > 0, with ||a;|| = 1 in (4.3), 
and then set t to find that 


J{z) = 


1 




(4.5) 


Note that J{z) is, up to a constant factor, the Laplace transform of Taking inverse 

Laplace transform and using (4.4), we obtain 


^7V/2-l/(i) ^ 1 r7+ioo 

2{(3N)^/^ ~ ^ 


e^^J{z)dz = 


7 — 100 


(/3iv) 


N/2 


■y /*7H-ioo 


,zt 


2txi 


’ 7 — 100 


n 


Vz - A, 


:dz 


(4.6) 


where 7 is an arbitrary real number satisfying 7 > Ai since J{z) is dehned for z > Ai. Since 
= |gjvLi| I {(3N), we obtain the desired lemma by setting t = jdN and recalling that |gg_i| = 
r(jv/ 2 ) |-| 

27r^/2 • 


From Lemma 1.3, the partition function (2.1) satisfies 

^ 7 +ioo 


Zm = Cn r e^GGldz, 

J 7 —ioo 


Cn = 


V{N/2) 


27ri(iV,0)^/2-i ’ 


where 7 > Ai, and 


G{z) = 2/32; - ^ ^ log( 2 ; - A,). 


(4.7) 


(4.8) 


We use the method of steepest-descent to this integral. The following lemma shows that there is a 
critical value of G[z) on the part of the real line z G (Ai, cx)) and we choose 7 as this critical point. 

Lemma 4.1. There exists a unique 7 G (Ai,(X)) satisfying the equation Gf'y) = 0. 

Proof. This is immediately obtained by noting that 


0'<^)=2/5-4E ' 


N ^ z-Xi 

I 

is an increasing function of 2 ; G M on the interval (Ai,oo) with 

lim G'( 2 :) = — 00 , lim G'{z) = 2/3 > 0, 

2 \ Ai 2-).00 


(4.9) 


□ 
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We also remark that 





> 0 , for z > \\. 


(4.10) 


Hence z = 7 is a saddle point of the real part of the function G{z), and Re(G( 2 )) decays fastest 
along the vertical line z = 7 + iy as |y| increases for small y. 

Remark 4.2. The analogue of Lemma 1.3 for Hermitian matrices is the following. For a Hermitian 
matrix M with eigenvalues Ai > • • • > Aat, we have 



where 


m) 

— r, •/ Aro\/ 


27ri(A^^)^-i ■ 


(4.12) 


This can be obtained similarly. By diagonalizing M and changing variables, the partition function 


IS 



This is evaluated by considering 

Jh{z):= [ 

5 High temperature case 

In this section, we consider the case (5 < and prove Theorem 2.10 for symmetric ensembles; the 
proof for Hermitian ensembles can be done in a similar manner by using (4.11) and we skip its 
proof. 

We use the method of steepest-descent in order to evaluate the integral (4.7) asymptotically. 
Since G{z) is random (since \i are random), the critical point 7 from Lemma 4.1 is a random 
variable. We approximate 7 by a non-random number 7 , which is the critical point in the interval 
((7+, 00 ) of the function 



(5.1) 


The function G{z) is the version of G{z) in which the random spectral measure is replaced by the 


limiting non-random spectral measure. From the discussions around (2.14), if /3 G (0,/3c), there is 
unique 7 satisfying 




(5.2) 


We start with the following lemma, which holds for all /3 > 0. 

Lemma 5.1. Assume Condition 2.3 and Condition 2.4- Fix 5 > 0. Then the following hold. 
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(i) For every e > 0 


(5.3) 


G'{z) - G'{z) = 0(iV-^+") 
uniformly in z > (7+ + 5 with high probability. 

(a) For each I = 0 ,the derivative G^^\z) = 0(1) uniformly in z E C\Bs with high 
probability where Bs = {x + iy : G- — 6 < x < 0 + + 6, —6 < y < 6}. 


Proof. For (i), set 

G{z) = 2/32: - ^ ^ log( 2 ; - 7 *) (5.4) 

i 

where 7 * is the classical location of the i-th eigenvalue as defined in (2.5). Then from the rigidity, 
Condition 2.4, 


G'{z)-G'{z) 


(Ai - 7i) 


ly 

iVy {z - Xi){z --fi) 


< — y < 


N 


N 


uniformly in 2 ; > 0+ + 5 with high probability. On the other hand, we claim that 

1 diy{k) 

z-li Jc- z-k 


G'{z)-G'iz) 


ly. 

N ^ 


<h 

“ N 


uniformly in 2 ; > C'_|_ + 6. For this, we dehne jj by 

/» ry ^ 

J 


dn{k) = 


nj 


N’ 


j = l,2,' 


,N, 


with 7 o = 0_|_. Note that 7 * < 7 * < 7 i-i. We then have for z = 2,3, • • • ,N — 1 that 


pi 


dv{k) 1 


z — k N z — ji 


< 


pi-2 

dzi-1 


dn{k) 
z — k 


(5.5) 


(5.6) 


(5.7) 


(5.8) 


for 2 ; > 0 + + (5. Summing over i and using the trivial estimates = 0{N ^) and y ^ ^z-k ~ 

0{N~^), we find that the desired claim holds. The estimates (5.5) and (5.6) imply (5.3). 

The part (ii) of the Lemma follows straightforwardly from the formula of G^^'^ ( 2 ;) and the rigidity. 
Condition 2.4. □ 


Corollary 5.2. Assume Condition 2.3 and Condition 2.4- Let (3 < fie. Let'j be the number defined 
in Lemma 4-1 o,nd let 7 be defined in (5.2). Then for every e > 0, 

/V2e 

l 7 - 7 l<-^ (5-9) 

with high probability. In particular, there is a constant c > 0 such that 

7 — Ai > c (5.10) 


with high probability. 
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Proof. Since 7 > C+, choosing 6 G (0, (7 — C'+)/2) in Lemma 5.1, we find that 

G '(7 ± ^ ^ Ar-1+2^) + 0(Ar-i+") (5 

with high probability. Now, since G'{j) = 0 and G'"{z) = 0(1) for 2 near 7 , the Taylor expansion 
of G implies that 

G"(7 ± lV-^+2^) = ±G"(7)1V-^+2^ + 0(lV-2+4e) + 0(lV-^+"). (5.12) 

Noting that G"( 7 ) = ^ shows that 

G'(j-N-^+^^) <0, G '(7 + iV-^+2") > 0 (5.13) 

with high probability. Since G'(z) is an increasing function of z, this proves (5.9). 

The estimate (5.10) is a consequence of (5.9), (2.6), (2.7), and the fact that 7 is a non-random 
number, independent of N, satisfying 7 > G+. □ 

Corollary 5.3. Assume Condition 2.3 and Condition 2.4 and let fd < (3c. Then for every e > 0 

G( 7 ) = G(7) + 0(iV-2+4^), G"( 7 ) = G"( 7 ) + 0(iV-i+2^) (5.14) 

with high probability. 

Proof. Prom the Taylor expansion, the definition of 7 , and Lemma 5.1 (ii), 

<^( 7 ) = 0 ( 7 ) G'( 7)(7 - 7 ) + 0(|7 - 7 p) = G( 7 ) -h 0(|7 - 7 ]^) (5.15) 


and 

G"(7) = G"(7) + 0(|7-7l). (5.16) 

The estimate (5.14) now follows from Corollary 5.2. □ 

We now evaluate the integral (4.7) using the method of steepest-descent. 

Lemma 5.4. Assume Condition 2.3 and Condition 2.4 and let f3 < j3c. Then for every e > 0, 


(*7+ioo 


'7 —100 




dvr 


NG"( 7 ) 


(1 + 0 (A^-^+®")) 


(5.17) 


with high probability. 


Proof. We had chosen 7 as the critical point of G{z) such that 7 > Ai. For this proof, it is enough 
to use the straight line 7 -|- iM for the contour instead of the path of steepest-descent. Changing 
the variables, we have 


(*7H-icx) 


/ 7 —ICXD 


VNJ-OO 

iefG(7) 




dt 


y/N 


f 


exp 


N 




(5.18) 


dt. 
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We now estimate the integral in the right hand side of (5.18). First, we have 


fTV' 


exp 


i-m 


N 




dt 


fAT" 


exp 


/-Are 

i-N^ 

l-m 


N fG"{j) (h)2 ^ ^ 0(iV-2+4- 


2 N 


6 iV3/2 


dt 


(5.19) 


exp 




dt — 1 -;= exp 

y_Are 12 Viv 


G"'(7). 


dt + 0(W 


-l+6e\ 


with high probability, where we used that G''{'y) > 0 and G^^\'y + it) = 0(1) for any t G M and 
^ = 3,4 (see Lemma 5.1 (ii)). The integral in the middle vanishes since the integrand is an odd 
function of t. From the estimate e“*^dt = we obtain that 


rN^ 


/-Are 


exp 


N 


-(W + i;7^)-G(7) 


dt = 


' dvr 

G^ 


(1 + 0(iV-^+®")) . 


(5.20) 


Second, the tail part of the integral in the right hand side of (5.18) satisfies 


/■oc 

JN^ 


exp 


N 


t 


y(G(7 + i^)-G(7) 


dt 


< 


poo 

Jn^ 


exp 


^ ^7-A,-+ itA/-i/2' 


■ J^log 


i=i 


poo 

\ /v_ 

^ ct^M 

< / exp 

J 

1 

o' 

crq 

1? 

+ 

1 — 1 


7-Aj 

dt 


dt 


(5.21) 


pN poo 

< / e-t^''dt+ / (cW-it2)-^Odt = 0(e-^') + 0(iV-^/®) 

Jn^ Jn 

with high probability, where we used (5.10). Thus, the tail part is negligible, and hence we obtain 
from (5.18) that 

p-y+ioo 


iefG(7) 4^ 


/y-ioo VN Y G"(7) 


(1 + 0(Af-^+®")) . 


(5.22) 


Finally, using Corollary 5.3, we conclude that 


(*7+icxD 


iefG(7) 4^ 


I e 2 ^^^^dz = - AI - 

/y-ioo VN V G"(7) 


(1 + 0(iV-i+6^)) 


with high probability. 

We now prove Theorem 2.10. 

Proof of Theorem 2.10. From Lemmas 1.3 and 5.4, for every e > 0, 


(5.23) 

□ 


Zpp = iOAre 2 


^G(7) , 


dvr 


iVG"(7) 


— (i + o(iv-i+^)) 


(5.24) 
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with high probability. Since 


Cn = 


T{N/2) 


VNf3 


2'it\{N 13)^1 ^ iy^(2/3e)'^/^ 

from the Stirling’s formula, we find that 


(l + 0 (iV-i)) 


-Ag{3) _ 


Zj^ = e 2 


{1 + 0{N-A) 


(5.25) 


(5.26) 


and hence 


Fn = ^ logZ^ = ^ (G( 7 ) - 1 - log(2/3)) + 1 (^log(2/3) - i logG"(7)) + 0{N-^A (5.27) 


with high probability. 
Define the functions 


(^(x) := log( 7 -x), V’(a^) := 


(5.28) 


(7 - x )2 

for X € [(7+, C-] and extend ip and ijj to bounded C°° functions with compact support on the real 


line. We then have 




N 

I 

with high probability. Regarding 7 as a function of /?, set 

rc+ 


/o(/3) := / \og{^ - k)<iv{k), / 2 (/ 3 ) := / 

Jc- JC- 


di'{k) 

c_ (9^' 


(5.29) 


(5.30) 


We find from Condition 2.5 that 


A ■= A - N f ip{k)du{k) = A - FffoiP) (5.31) 

i i 

converges in distribution to Gaussian random variable with mean M{(p) and variance V((p). Simi¬ 
larly, we also have that 

:= V’(A*) - N Ak)Mk) = A ^ (5-32) 

i i 

converges in distribution to Gaussian random variable with mean M{'ip) and variance V{'ip)- 
Thus, (5.27) becomes 




21og(2ff)-log(/2(g)) _J_ / Nt \ 1^ 

2N 2N ^\ Nf2{/3) J ^ ^ 


(5.33) 


with high probability. Since A/)/, converges to a Gaussian random variable, we see, in particular, 
that the random variable log(l -|- in the right hand side of (5.33) converges in distribution 
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to 0 as —)• oo. Since the convergence in distribution to the constant 0 implies the convergence in 

probability to 0, using Slutsky theorem, we conclude that 

21og(2/3) - log(/ 2 (/ 3 )) 

2 

converges in distribution to a Gaussian with mean and variance This completes 

the proof of the theorem. 

□ 


NFn -N ip j- 


/o(/3) + l + log(2/3) \ 


6 Low temperature case 


In this section, we consider the case P > Pc and prove Theorem 2.11. As in the last section, we only 
give a proof for symmetric random matrices; the proof for Hermitian random matrices is similar. 

As we saw in the previous section, the location of 7 , the critical point of the function G, is 
crucial in the asymptotic evaluation of the integral in (4.7). When P < Pc we approximated 7 by 
the deterministic number 7 that is the critical point of the deterministic approximation G{z) of 
the function G{z). However, when P > /3c, G{z) does not have any critical point in ^; > C_|_ and we 
cannot approximate 7 by a deterministic number. The following lemma shows that 7 is close to Ai 
up to order about 1/N. 

Lemma 6.1. Assume Conditions 2.3 and 2.4- Let P > Pc- Then for every 0 < e < |, 


1 

- < 7 

3pN - ' 


Ai < 


/V4€ 

~ir 


( 6 . 1 ) 


with high probability. 


Proof. Since 

= .>A.. (6.2) 

i 

we see that G'(Ai + 3 ;^) < 0. 

Since G'{z) is an increasing function for z > Ai, the Lemma is proved if we show that G'(Ai + 
jY-i+ 4 e^ > 0 with high probability. By Condition 2.4, we may assume that the eigenvalues Aj’s 
satisfy the rigidity (2.6): this event occurs with high probability. For such Aj’s we need to show 
that G'(Ai + > 0. In order to show this, we write 




N^z-K 

1=1 


1 

iV 


1 


Ar_Ar3e 

? Ai 

i=iV3£ + l 


1 

iv 


N 

E 

i=N-N^^+l 


z - Xi 


(6.3) 


with 2 = Ai + A^ i+4e_ Pqj. 1 < j since Ai > A*, 


1 

N 


iV3e 


E 


1 

a7TaFi+47 


Ai 


0{N-^). 


(6.4) 


For < i < N — N^‘^, we have from the rigidity condition (2.6) that |Ai — 7 ,! < N We also 
note that C+ — ji > from (2.7). On the other hand, since Ai — 71 = 0(77“^/^+'^) 


23 












from (2.6) and C+ — 71 = 0{N from (2.7), we have Ai = (7+ + 0{N Therefore we hnd 


that 


7V-Af3' 

1 V _i_ 

AT Ai + 7V-l+4e 

i=Ar3'=+l 


Ai 


N 


N-N^‘ 

E 

i=Ar3£+l 



(1 + 0(7V-^)). 


(6.5) 


The right hand side can be estimated by applying the idea used in the proof of Lemma 5.1. Recall 
the definition of 7 ^ in (5.7). Summing the inequalities (see (5.8)) 


diy{k) ^ 1 1 ^ n*-^ di^jk) 

4^, C+-A: - iVC+-7i c+-k 

over i from +1 to and recalling that (3c = ^ c^J'-k Definition 2.8, we find that 


N-N^‘ ^ 

V E 


N V (7+ - 7i 

i=Ar3«+l + '* 


^ , C'+ - Jc- 


di'{k) 


< C 


^'T'jV-Ar3e_2 


di^{k) 


+ 


i=Ar3e + l 

dv{k) 


7 o f7-|_ k 

'Ar3e+2 ^ , 


- fc 

Finally, for N — < i < N, since Ai — A* > 1, we find 

1 V _ - _ 

N Ai + iV-i+4^ - Ai 

i=Ar-Af3E + i 


C_ C+-k 
= 0(iV"^/3+"). 


= 0(iV-^+3^). 


Combining the estimates, we find that 

G'(Ai + iV-i+4^) = 2(3- 2(3c + 0 ( 1 ) > 0. 

This proves the lemma. 


(6.7) 


( 6 . 8 ) 


(6.9) 

□ 


We will show in Lemma 6.3 below that the method of steepest-descent still applies and the 
main contribution to the integral representation of comes from G{'y). The value of ^( 7 ) is, 
heuristically, 


G(7) = 2/37 
«2/37 


N 


N 


^log(7 - Ai 


i=l 

N 


-y 

N ^ 


i=l 


log(C+ - Ai) + 




■(7-C+) 


( 6 . 10 ) 


~ 2 / 37 -/ logic+-z)diy{z) - f 

Jc- Jc- C+- z 

rC+ 

« 2/3Ai - / log(C'+ - z)dj^iz) - 2(3ciXi - (7+) 

Jc- 

where we used (2.9) and Lemma 6.1 for the last line. We show this approximation rigorously and 
also estimate the derivatives of Gi'j) in the next Lemma. 
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Lemma 6.2. Assume Conditions 2.3 and 2.4- Let fd > fdc- Let 7 he the solution of the equation 
= 0 in Lemma 6.1. Then for 0 < e < | 

G( 7 ) = 2/3Ai - log(C+ - z)di^iz) - 2/3,(Ai - C+) + 0(Ar-i+4^) 

Jc- 

with high probability. Moreover, for 0 < e < there is a constant Cq > 0 such that 

< C^iV^-i+3^ (6.12) 

[l — ij! 

for all i = 2,3, ■■ ■ with high probability. Here, Cq does not depend on i. 

Proof. We may assume that the eigenvalues Aj’s satisfy the rigidity Condition 2.4 and Lemma 6.1 
since these event occurs with high probability. 

Since we have from Lemma 6.1 that 7 = Ai + part of the lemma is proved 

if we show that 

1 ^ 

- J^log( 7 -Ai)- 

i=l 

Mimicking the proof of Lemma 6.1, we first consider 1 < i < Since 7 — Ai > 3 ^ with high 
probability by Lemma 6.1, we have the trivial estimate 


rC+ 


IC- 


log(C'+ - k)d,zik) - 2 / 3 e (7 - c+) 




(6.13) 


iVS" 


N 


log (7 - Ai 


i=l 


< log N. 


Similarly, 


1 

iV 


N 


Y log (7 - Ai 

i=Ar_iv3£+i 


< 


CiV36 


N 


We now consider the case < i < N — Note that 


log (7 - Ai) - log(C'+ - 7 i) = log ( 1 + Z—— + 


C+ - 7i C+- 7i 


7-g+ 

C+--fi 


+ 0 


1-C+ 

C+-l^ 


+ 0 


Ai - 7i 

C+-l^ 


(6.14) 


(6.15) 


(6.16) 


The first error term can be estimated by 


c+-^i) - i4/3Ar-4/3 




(6.17) 


from (2.7) and 7 — C+ = (7 — Ai) + (Ai — (7+) = 0{N 2/3+e^ ^i^g Lemma 6.1. For the second 
error term in (6.16), we consider two different cases. For <i < Ll/2, we use the estimate 


|Ai - 7 i| ^ i 2/3+e 

C+ - 7 * - i2/3^-2/3 


CNH-^ 


(6.18) 
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from (2.5) and (2.7). For N/2 < i < N — we simply use 


|Ai -7i| 
C+-l^ 


< CN 


-l+e 


Since 


Ar_Ar3£ 


N/2 


^ iV2.--4/3 < ^ ^ 


(6.19) 


( 6 . 20 ) 


i=Ar3e + l 

we obtain, after summing (6.16) over i, that 

Ar_Ar3e ^ N-N^^ 

N 


i=Af3E + l 


1 

iv 


^ log(7 - -^0 - ^ Y - 7i) - ^ 

i=7V3E + l 

From (6.14), (6.15), and (6.21), we conclude that 


Ar_Ar3e 

E 


N V (^+ - 7i 

i=Ar3£+i + '* 


< 


CN^ logiV 


N 


(6.21) 


AT 


Ar_Ar3e 


.... „ N—N^^ 

^ Y log(7 - - 4 Y1 ^ At ^ ^ 




i=l j=Ar3E+i 

We proved in (6.7) that 

N-N^‘ 


N ^ - 7i 

i=7V3.+i + 


= 0(iV"^+^"). (6.22) 


N ^ ^+-7, 

j=Ar3e + l + '* 


= 0(iV"^/3+"). 


(6.23) 


On the other hand, following the arguments in (6.6) and (6.7), we obtain 


1 

N 


Ar_Ar3e 


< c 


i=Ar3e+i 

^'yN-N^C-2 


Y ^°s(C+ - 70 - [ log(C'+ - k)du{k) 

\T?„= I 1 •JC— 

log(C'+ — k)dv{k) + f log(C'+ — k)di'{k) 

^ W I <-> 


(6.24) 


< C'iV-^+3" + c 


L 


_/V“ 2 / 3 + 2 e 


\/k log K dK 


< CN-^+^nogN. 


Inserting these estimates into (6.22), we obtain (6.13). Hence the first part of the lemma is proved. 
We now prove the second part of the lemma. We have 




N 

for £ = 2,3, • • • . For the lower bound, we use Lemma 6.1 to obtain 

N 


(6.25) 


-Y 

^ s (7 - Y “ iV (7 - Ai)< 


1 1 1 
> 


> N' 


£-l-4fc 


(6.26) 
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For the upper bound, we use 


N^' ^ 

jV ^ _ \ M — 


TV'"" 


and 


1 ^ 
- y 

N ^ 


< 


^ (7 - \Y 


1 ^ 
- y 

N ^ 


< 


-l+3e 


(7 - 


< (3/3)^iV 


?-l+3e 


(6.27) 


< 


AT 

E 


" - A)' - iv (C, - T,)' - iv (?/3JV-=/3)' 

This completes the proof of the lemma. 


< 3 • 


(6.28) 

□ 


In the supercritical case, we have the following lemma corresponding to Lemma 5.4 in the 
subcritical case. 

Lemma 6.3. Assume Conditions 2.3 and 2.4- Let (3 > fdc- Let 7 he the unique solution to the 
equation G'{^) = 0 defined in Lemma 4-1. Then, there exists K = K{N) satisfying N~^ < K < C 
for some constant C > 0 such that 

/ (6.29) 

J 7 —icXD 

with high probability. 

Proof. Define K by the relation (6.29); 


K := -ie-T^M / eT^Ddz. 

J 7 —ioo 


(6.30) 


Then K is real and positive since Z > 0 in (4.7) and ^( 7 ) is real. We now estimate K. 

Fix 0 < e < ^. Since e < we may assume that, as in the previous two lemmas, the eigenvalues 
Aj’s satisfy the rigidity Condition 2.4, Lemma 6.1 and Lemma 6.2 since these event occurs with 
high probability. 

To prove the upper bound for K, we consider 


K = -i 


(7 [G( 2 ) - G(7)i) dz = y” exp (7 (G(^ + i() _ G(^)]) it 

y“expAwt-i^log(^l + 


it 

j-Xj 


(6.31) 


N 


< 


exp 




i=i 


(7-A7^ 


dt 


where the last step was obtained by taking the absolute value of the integrand. Since 7 — Aj < 
7 — Atv EC for some C (with high probability), we find that 


K < 


/ oo / N \ 

exp ( — — log (1 + ) df = / (1 + C't^)“^/^dt < C (6.32) 

■OO \ ‘^ J J —OO 
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for some C > 0, if is sufficiently large. This proves the upper bound for K. 

In order to prove the lower bound for K, we consider the curve of steepest-descent that passes 
through the point 7 in the complex plane. This curve, denoted by T, satisfies lm.G{z) = 0. (Note 
that the real axis also satisfies ImG( 2 :) = 0.) It is easy to check, using the formula (4.8) of G{z), 
that 

(i) r n C+ is a G^ curve, 

(ii) r intersects the real axis only at 7 , 

(iii) the tangent line of T at 7 is parallel to the imaginary axis, and 

(iv) the real axis is the asymptote in the —00 direction. 

Since all solutions of the equation G'{z) = 0 are on the real axis, the function G{z) is decreasing 
along the curve T n C"*" as z moves from the point 7 to the point — 00 . 

We first have 


r'y+ioo 


ef G(.)d^ = 


efGWd^. 


(6.33) 


' 7—100 


This follows by noting that ReG(z) < 2(5^ — log(i?/2), for \z\ = R such that Rez < 7 , and hence, 
if we let Gr be the circular arc \z\ = R such that Re 2 : < 7 , then 




'Cr 


< 


Cn 


R^/4-i 


as R —>• 00 where Gj^ is a constant depending on N. From (6.33), we have 


(6.34) 


K = -i 


-'L 


exp 


N 


[G{z) - G( 7 )] dz. 


(6.35) 


Let 

r+ = rnc+, r- = rnC". (6.36) 

If substitute dz = dx + idy in (6.35), we see that the integral with respect to dx vanishes since 
K > 0. (More precisely, the contribution from exp[(A^/ 2 )(G( 2 ;) — G( 7 )](—dx) on T^ is cancelled by 
the contribution from exp[(N/ 2 )(G( 2 :) — G( 7 )]dx on T".) We thus find, using that G{z) = G{z), 
that 

K = 2 exp [G{z) - G( 7 )]^ dy. (6.37) 

Now let R 7 V -2 be the ball of radius N~‘^ centered at 7 . Since G{z) is analytic for 2 ; such that 
Rez > Ai, G(z) is analytic in R 7 V -2 from Lemma 6.1. Hence for 2 ; G r+ n B^- 2 , 


G{z) - G{-f) 



G0)(7) 

~1^ 


(z-jy. 


(6.38) 


Due to the second part of Lemma 6.2, this power series converges uniformly in Br [-2 for all N > Gq 
where Go is the constant in Lemma 6.2. Recall that ( 7 ) is a real number for all j; more precisely, 
(— 1 )'^G(-^^( 7 ) > 0. Comparing the imaginary parts of the both sides of (6.38), we find that for 
2 G T^ n Bj\f-2, 

0 = G"(7) Re(2 — 7) Imz H-(Re(2 — 7))^ Imz-^(Imz)^ + D ( 6 . 39 ) 

2 ^ ^ 6 
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with 


(6.40) 




j=4 


Note that Im ((z — 7 )-^) divided by Im( 2 : — 7 ) is a polynomial in Re( 2 ; — 7 ) and Imz. Dividing (6.39) 
by Imz and G''{'y), we find that 2; = (X + 7) + iE € r+ n Bpf -2 solves the equation 


where 


and 


F{X, Y) = 0, F{X, Y)-.= X- ^ Y) 

2 6 


a:=-YM>0. 




G"(7) 

G(^)( 7 ) Im((X + iy)J) 


i=4 


j!G"( 7 ) 


y 


Using the general inequality 


I Im ((a + i 6 )-^) I < j|a + i 6 P ^| 6 | 


(6.41) 

(6.42) 

(6.43) 

(6.44) 


for real numbers a, b, for j > 1 , which can be checked easily by an induction and the trivial bound 
I Re ((a + ity) | < |a + i&P, and using the second part of Lemma 6 . 2 , we find that 


°° ni )yj-l+3e 

1^1 ^ E \l-Se ^ 

j=4 

for 2 ; = {X + 7 ) + iy E r+ n Bj^- 2 , for all N > 2Co. From the second part of Lemma 6.2, 

< a < 

Fq 

and since e < i, we have 


(6.45) 


^ 116^^2 ^ < y, < aY 


lUv2 


(6.46) 


(6.47) 


uniformly for 2 ; = (X + 7 ) + iy G B ^-2 where the notation a^r <C b]\f for sequences a^r and 6 jv 
means that ^ ^ 0 as —>• 00 . Hence equation (6.41) becomes 


a. 


and hence 


y(i + o(i)) + ^y2(i + o(i)) = o, 

6 


X = -^YHl + om a = -^>0, 


(6.48) 


(6.49) 


uniformly for 2 ; = (X + 7 ) + iy G r+ n Bj^- 2 . This shows, in particular, that Re 2 ; < 7 for 
2 ; G r+ n Bj^-2 . Moreover, it is direct to check, by proceeding as above, that 


a. 


dxF = i + o{i), ayF = -y(i + o(i)), 


(6.50) 
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uniformly for = (X + 7 ) + iY G Bj^- 2 . (Here, for the estimate of we use the inequality 

I Re ((a + iby — \ < ^ —^|a + i 6 P“^| 6 |^ (6.51) 

for real numbers a, b, for j > 2, which can be checked easily by an induction and the bound (6.44).) 
Therefore, r+ n Bj ^-2 is a graph, and dy = dY is positive on r+ n Bj ^-2 and T'*' intersects dBj ^-2 
at exactly one point. 

Let Z 2 G C"'' be the point where T"*" and dBj ^-2 intersect. Since Ty is a path of steepest-descent, 
ImG(^; 2 ) = ImG( 7 ) = 0 and G{z 2 ) < G( 7 )- From (6.38) and Lemma 6.2, 

1 00 .. .. 

G(7) - G{z2) = \G{z2) - G(7)| > -^N^-^^\z2 - 7l" - E -CIN^-"^^^\z2 - 7 P > (6.52) 


for all large enough N. On the other hand, consider Bj^- 3 , the ball of radius N ^ centered at 7 , 
and let z^ G C"*" be the point where F"*" and dBj ^-3 intersect. Then, by a similar argument, 


G(7) - G{zy = |G(z3) - G(7)| < 2GlN-^+^^ 


for all large enough N where Gq is the constant in Lemma 6.2. 
Next, we claim that, for any decreasing function / : F^ —)• M, 



g/(2)dy > 0. 


(6.53) 


(6.54) 


To check the claim, we parametrize the curve F"*" = F'*'(t), t G [0,1], with r'*'(0) = 7 and r'*'(l) = 
— 00 . Note that lmr'*'(0) = ImF“*’(l). Suppose that ImF“*'(t) increases on (0,fo) and decreases on 
(to, !)• (In particular, it attains its maximum at t = to-) Then, we have 



g/( 2 )dy = f Im(r^)'(t)dt 

Jo 

= / °e^(^^W)ijn(r+)'(t)dt+ [ e^^^^J^hm{r+y{t)dt 

Jo J to 

> [ Im(r“'")'(t)dt + f Im(F“'")'(t)dt 

Jo J to 

= g/(r+do)) f im(F+)'(t)dt = 0. 


(6.55) 


This shows that the claim (6.54) holds in this case. For the general case, suppose that there are real 
numbers 0 < to < ti < • • • < t 2 fc < 1 such that ImF'''(t) increases on (0,to), (ti,t 2 ), • • • , {t 2 k-i,t 2 k) 
and decreases on (to, ti), (t 2 , ta), • • • , {t 2 k, 1)- (From the explicit formula of Im G{z) = 0, it is direct 
to check that F^ approaches to —00 monotonically downward, and hence there are only finitely 
many such real numbers tj’s.) Set 


A 2 p = Imr+(t 2 p) - Imr+(t 2 p-i), A 2 P +1 = ImF+(t 2 p) - Imr+(2p+1), (p = 0,1, • • • ,k) 


with t_i := 0 and t 2 fc+i := 1. Note that A 2 p > 0, A 2 p+i > 0, and 

Ao + A2 + • • • + A2p > Ai + A3 + • • • + A2P+1 
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for any p = 0,1, • • • ,k since r"*" is above the real axis. Applying the same strategy as in the case 
A; = 0, we obtain 


Jo 

> e/(r+(io))(Ao - Ai) + e^(r+(t 2 ))(A 2 - A3) + • • • + - As^+i) 

> - Ai + A2 - As) + - A5) + • • • + - A 

> g/(r haiiDj-AQ _ Ai + A2 — A3 + • • • + A2fe — A2fc+i) > 0. 

This proves the claim (6.54). 

We apply (6.54) to the function 


2fe+lJ 


= I f (Giz 2 ) - G(7)) if z g r+ n 


f (G(z)-G( 7)) ifzG^+n(i?^-2)^ 


and obtain 


'r+ns„_2 


exp ( ^ [^(2:2) - G(7)] ) dy + 


/r+n(s^_2)= 


Now, since dy is positive in r+ n we have the estimate (recall (6.37)) 


-K = / exp 

2 M 


N 


[G{z) - G(7)] dy 


'r+nB^_3 


> 


L 


N 


exp — [<^(23) - G{j)] dy + 


/r+ns^_3 

Subtracting (6.57) from (6.58), we find that 


/ 


exp 

-3)" '' 



fN 

exp 

-3)" 



-K > 
2 “ 


I 


r+nSj^_3 L 

+ / 


N 


exp — [G{z 3 ) - G{j)] - exp — [G{z 2 ) - 0(7)] 


N 


dy 


exp 


> 


T+n(Bj^_3)'=nBj^_2 L 
'N 


N 


[G{z) - G(7)] - exp — [G{z 2 ) - ^(7)] 


N 


(6.56) 


exp { ^ [G{z) - G(7)] ) dy > 0. (6.57) 


exp [G(2;) - G(7)]^ dy + ^ yxp [G(z) - ^(7)]^ dy (6.58) 


dy (6.59) 


/r+nB„_3 L 


exp 


[G{z 3 ) - G(7)] - exp — [G{z 2 ) - G(7)] 


N 


dy. 


Since /p+p^ _3 dy = Imzs > G\z 3 — 7I = GN ^ for some constant C > 0 (see (6.49) and (6.46)), 
we find, using the estimates (6.52) and (6.53), that 

1 


-A > CN 


-3 


exp {-ClN-^+^^) - exp ( --N 


T-2-8e 


> CN 


-5-8e 


This completes the proof of the lemma. 


(6.60) 

□ 
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We now prove Theorem 2.11. 


Proof of Theorem 2.11. From (4.7), (5.25), and Lemma 6.3, we obtain 


(6.61) 


with high probability, where K satisfies N ^ < K < C. Thus, using Lemma 6.2, we find that 


^ log = ]^ [G( 7 ) - 1 - log(2/3)] +0{N ^ log N) 
1 / /■^+ 


/3Ai - /3e(Ai - C+) - ^ ^ log(C+ - z)dv{z) + 1 + log(2/3)) + 


(6.62) 


2/3Ai - 2/3c(Ai - C+) - [ log(C+ - z)dv{z) - 1 - log(2/3) 

JC- 


+ 0{N-^+^^) 


with high probability. Hence, we obtain that 

Fm - F{(3) = (/3 - /3e) (Ai - C+) + 0(iV-i+4^). 


(6.63) 


where F(/3) is defined in (2.16). Now the proof of the theorem follows from the edge universality. 
Condition 2.6. □ 


7 Third order phase transition 

In this section, we prove Theorem 2.9. 

Proof of Theorem 2.9. The convergence of T/v to F{j3) in distribution follows from Theorem 2.10 
and 2.11 since normal distribution and Tracy-Widom distribution have exponential tails. 

We now prove that T(/3) is (7^ but not at /3 = /3c. It suffices to prove the statement for 


1 l + log(2/3) A 


Set 


F(/3) := - { F(/3) + 


/o(/3) = [ log (7 - k)di'{k) 
Jc- 


(7.1) 


(7.2) 


as in (5.30). Recall that 7 = 7 (/ 3 ) is a function of /3 and and it has a limit as /3 ^ Pc'. p{Pc) = C*+ 
(see the discussion after (2.14)). Hence, if we set 


MPc) = [ log(C'+ - k)di^{k), 

Jc- 

then /o(/3) foiPc) as /3 ^ Pc- With this definition, 

^..)^ f7(/3)-^/o(/3), /3</3c, 

\C+- 2^/o(/3c), P>Pc. 


(7.3) 


(7.4) 
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From this it easy to see that 


limdfF(H) = 


foiPc 






3fo(/3c) 
/3c^ ' 


(7.5) 


We now calculate the limits as 13 f3c- Since 7 (/ 3 ) satisfies (see (2.14)) 



r -28 

Jc. P{P)-k 

(7.6) 

we find that 

df^hiP) = [dpi) = {2P)dpp. 

J c _ ^ 

(7.7) 

Thus, 

0 F/m 0 fo{P)\ foiP) 

a,FW) = a, (yw j = 2^52 . 

(7.8) 

and hence 

lim dBF{(3) = = lim dRF{f3). 

(7.9) 

Differentiating (7.8) again, we obtain 



n2F.m_^/37 UP) 

(7.10) 


We claim that 9^7 Q as j3 13c- This can be checked by differentiating (7.6), 



(7.11) 


and noting that the integral tends to oo as /3 ^ /3c due to the square root decay of 


Therefore, 




Finally, after differentiating (7.10), 


dp{l3) 


35 _ ^ 


Since ( 9^7 0 as P f3c, 


lim d^sF(/3) = 

0F0C. ^ 


lim 


/3/'A 




Pc 


'£l 3/o(/3c 


Pt 


(7.12) 


(7.13) 


(7.14) 


Comparing with (7.5), it suffices to show that lim d'^p 7 ^ 0 to prove the theorem. Differentiating 
(7.11) once more, we obtain 


rC+ 

-(5|7) / 

JC- 


dvpk) 

L 


+ 2 ( 5 ^ 7 )' 


rC+ 

"L 


diy{k) 

(7 — k)^ 


= 0 . 


(7.15) 
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Since ^ has the square root decay, we have 


dpi 


r^+ du{k) 

Jc- {i-ky 


Jc- 


dv{k) 

c_ (7-A:)3 


(7-C+)-3/2, 


(7.16) 


as P Pc- We thus conclude from (7.15) that 5^7 ~ 1. This completes the proof of the desired 
theorem. □ 


A Appendix 


In this appendix, we evaluate various constants stated in Section 3. As a main tool of the evaluation, 
we use the Stieltjes transform of the measure z/, defined by 


m[z) = 


rC+ 


dv{x) 


z G C \ [C_,C'+). 


/c_ X- z 

Note that due to the square root decay of at x = (7+, m(C'+) is well-defined. 


(A.l) 


A.l Wigner matrix 

We only consider real Wigner matrices. The complex case can be evaluated by the same way and 
we skip the details. The limiting spectral measure for real Wigner matrices is 4 — 

—2 < X < 2. Hence 

/ N —-2 + — 4 ,. , 

m{z) = ---, (A. 2 ) 


and we find that 


^ 1 f^+ diz(x) . , , . 

^ Jc- C+ - X 


(A.3) 


We now evaluate 7 defined in (2.14). This equation is equivalent to the equation = 2/3. 

Solving this equation, we find that 

7 = 2/3 + ^. (A.4) 

To evaluate (2.15) and (2.16), we note that 

/: 

for 2 : G C \ (— 00 , 2 ]. This follows by noting that 


log(z — x)di^(x) = “ \/z"^ — + log (^z -b — 4^ — log 2 — 


(A.5) 


—m{z) = 

dz 


-2 (^z — \/z'^ — 4^ -b log (^z + y/z'^ — dj 

and log( 2 ; — x)di^(x) = logz -b 0{z~^) as z ^ 00 . Thus, using (A.4), we obtain 

if/3<l/2 


(A. 6 ) 


m = 


2^-}£sm±yi if/3>i/2. 


(A.7) 
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We now evaluate £(/3) and (T^(^) for the case when /3 < f3c- First, since 


f 


dvix) . 1 z 

{z-xY ^ ^ 2 


zeC\[-2,2], 


(2.18) becomes 


4(/3) = log(2/3) - i log (m'(7(/3))) = ^ log(l - 4/3^) 


Second, we evaluate (see (3.3)) tY'-p) = ^ = 0; 1; 2, and 4, where 


t^z) = - log(z - x) 


Ti{x/2) 


vr 


'-2 


\/4: — 




2 ; e C \ (— 00 ,2), 


We start with evaluating 


1 


Ti{x/2) 


4(^) = - ! ! . Pa - 2 

vr J _2 (z — x)v4 — x^ 


dx. 


(A.8) 

(A.9) 

(A.IO) 

(A.ll) 


Recall the recursions Ti+ 2 {y) = 2yT£+i(y)—T£(y), i>0, and the orthogonality relation f}-. ^^I^ dy 
0 for all integers 0 < k < i, for the Chebyshev polynomials. From this we obtain the recursions 

4+2(^) = K+ii^) - (A.12) 

for £ > 0. Since To(x/2) = 1 and Ti(x/2) = x/2, it is easy to check from residue calculus that 

1 , , , 2 : 1 




A{z) = 


2^12^74 2' 


Hence 


t'2{z) = 


-2 


z ,, . 2 ^ - 42;2 + 2 z^ 

- , -, tA(z} = - , - h Z. 

2Vz^ - 4 2 2Vz^ - 4 2 


(A.13) 


(A.14) 


Taking the anti-derivaties and noting from (A.IO) that to(z) = \ogz + 0{\/z) and ti{z) = 0(1/2:), 
£ > 1, as 2 —)• 00 using the orthogonality relations of the Chebyshev polynomials, we find that 


to{z) = log (^z + \/z‘^ - log 2, ti{z) = ^y /^2 _ 4 _ 


(A.15) 


and 


*2(2) = |v7vr:4-^ + i = + (a.m) 


Hence, 


'ro{p) =-log{2(3), Ti(yv) = -2/3, T2{p) =-20^, r4(yv) =(A.17) 

Third, VcoEif) (see (3.7)) is evaluated from the lemma A.l below. Hence we find that 


Vgoe{p) = ^72(7(/3),7(/3)) = -21og(l - 4/3^) (A.18) 

where F{z,w) is defined in (A.20). We also have (see (3.6)) 

Mgoe{f) = ^ log(l - 4/3^)- (A.19) 

Therefore, combining with (A.9), (A.17), we obtain (3.12) and (3.13). 

It remains to prove following Lemma. 
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Lemma A.l. Set 

f2 r2 


L{z,w)= / {log{z - x)-log{z - y)) {log{w - x)-\og{w - y))Q{x,y)dxdy, 

J-2J-2 

for z,w G C \ (— 00 ,2 ], where 

A — xy 


Qix,y) = 


{x — yYVA: — x‘^\/ A — y^ 


Then 


L{z, w) = 27r^ log 


{z + R{z)){w + R{w)) 


2{zw — 4 + R{z)R{w)) 
for z,w gC \ (— 00 , 2 ], where R{z) is defined with branch cut [—2,2]. 
Proof of Lemma A.l. Consider the second derivative 


R{z) = — 4, 


(A.20) 

(A.21) 

(A.22) 


L,,,,(z,w) = 


1-2 


1 


1 


1 


1 


A — xy 


fdy 


dx. 


_ 2 \z-x z-yj\w-x - yy (x - y)2V4^^y^4^^ 

^ (A.23) 

Recall that R{x) = y/x"^ — A is defined with the branch cut [—2,2]. Using this, it is easy to check 
that 


^ J 


1 


1 


1 


1 


z — x z — yj \w — x w — yj {x — yYR{x)R{y) 


A — xy 


-Ay 


dx (A.24) 


where S and S' are simple closed contours, oriented positively, that contain the interval [—2, 2] 
in its interior and the points 2 and w in its exterior, and S' lies in the interior of S. By residue 
Calculus, we obtain 

—27r^ A — zw 

L'zw (- 2 ^? — 


(z — wY 


1 + 


R{z)R{w) 


(A.25) 


Integrating with respect to z, 


-2t: [R{z) - R{w)) 

LAz,w) = - / ^ p - + Cl (w) 


{z — w)R{w) 

for some function Ci{w) of w. The function Ci{w) is determined by noting that, from (A.26), 


(A.26) 


Ly,{z,w) = 

as z ^ 00 with fixed w, while from (A.20) 

f 2 /■2 


-27r 

R{w) 


+ Ci{w) + 0{z ^) 


(A.27) 


L^{z,w) = L{z,w) = [ [ {log{z - x) - log{z - y)) (— - -—'] Q{x,y)dxdy, (A.28) 

. 1-2 J -2 \w-x w-yj 


X w — y ^ 

is 0{z~^). Hence Ci{w) = -^Y)- Integrating with respect to w, we find that 


L(z, w) = 27r^ log 


{w + R{w)) 


PC2iz) 


(A.29) 


2{zw — 4 + R{z)R{w)) 

for some function C 2 {z) of z. By considering the asymptotics as w ^ 00 , we find that C 2 {z) = 
2 A [log(z + R{z)) — logic], and this completes the proof of Lemma. □ 
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A.2 Sample covariance matrix 


We again only consider real case here. Complex case is similar. The Stieltjes transform of the 
Marchenko-Pastur distribution du (3.25) is given by 


( \ —Z + + ^/{z — c^){z — C_) ^ rr\ ^1 

my{z) = ---, z€C\\C-,C+\. 


2 z 


(A.30) 


Here Cj^ = {y/d + 1)^ and C- = {y/d — 1)^. Hence, for real sample covariance matrices, we have 

1 f^+ di^(x) my{C+) 1 


/3c = 


-f 

•iJc- 


IC- C+— X 2 2{y/d + l) 

In the high temperature case, /3 < /3c, we get from the dehnition of 7 that 

-my{j) = 2/3. 


(A.31) 


(A.32) 


Solving the equation, we find that 

d 1 
' 1-2/3 2/3 

We note that 7 is a decreasing function in /3 and satishes 7 > (7+ for 0 < /3 < /3c. 
evaluate T(/3) in (2.15), note that 


(A.33) 
In order to 


T /_^ = (^) ^ /_^ - A)dUA) 

/d 7 \ . / 2 d 1 \ ^ ^ 

using (A.32) and (A.33). Hence, by taking the antiderivative, 

J ^ log( 7 (/ 3 ) - X)du{\) = + dlog(l - 2/3) - log(2/3), (A.35) 


where the constant of integration is determined by the asymptotics that the left hand side behaves 
as 

log 7 + 0 ( 7 "^) = - log(2/3) + 0(/3) 

as 7 —)■ 00 , or /3 ^ 0. Hence 

^(/3) = -^log(l-2/3) 

for /3 < /3c. 

The limit of the free energy per particle L{/3) for /3 > f3c is obtained easily using (A.35) and 
noting that (7+ = 7(/3c). 

We now evaluate I and for /3 < /3c. Prom (2.18), we find that 

h = log(2/3) - Y log(mU7)) = Y - 4^^) (^-36) 


where we set 


B = 


/3y/d 

1-2/3' 


(A.37) 
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On the other hand, in other to evaluate M{ip) and V(p) for p(x) = log(j—x), we observe that (3.26) 
implies that 

= log l^2B + 

where B is given above. Note that 'ip{x) is the same function as p{x) = log (7 — x) for Wiger 
matrices with the change that j3 is replaced by B. Moreover, we have ti{c + /) = Te{f) for £ > 0 
and To(c + /) = c + to(/) for a constant c and function /, from the definition of T£ and the 
orthogonality of Chebyshev polynomials. Prom this and the results obtained for Wigner matrices, 
we can easily find M{p) and V{p) for sample covariance matrices. 


^{x) = log{Vd)+ 'ip{x), 'ip{x) 


2B 


— X 


(A.38) 
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